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A myth of infallibility has grown up around the so-called technique of 
"voiceprint identification" as a result of widespread publicity given to the 
sanguine but unsubstantiated claims of its proponents, especially L. G. j 

Kersta.^ A method of identifying people from recordings of their voices would | 

be a most useful tool in law enforcement, counterespionage, and many other 
applications. However, the hypothesis that because each individual’s vocal | 

cavities are different, his acoustic output must likewise be unique, is en- j 

tirely unproven. The artfully nurtured analogy with fingerprints is grossly 
misleading: intra-^ variance of palmar ridge conformations is practically nil 

from birth to death; large intra-rS variance of vocal tract shape and acoustic 

output is a sine qua non of oral communication. | 
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Vanderslice 

Attempts to replicate Mr. Kersta's earlier quasi-formal and inadequately 
reported*^* experiments (Stevens, Williams, Carbonell, & Woods, 1968; Young & 
Campbell, 1967) have obtained very much higher error rates, but do not thereby^ 
directly refute his claim to a personal record of 100% correct identifications. 

As with phrenology a century earlier, "voiceprint" advocates dismiss any counter- 
evidence as irrelevent on grounds the dissenting experimenters are non-initiates . 
Kersta's ostensible infallibility can be directly tested only if he agree.s to 

demonstrate it by taking a test prepared by an independent phonetics laboratory, 

7 8 ,, 

in which sophisticated vocal disguises and competent mimicry, as well as 
voices chosen to be naturally similar, are employed. Kersta's resolute refusal 
to accept such a challenge,^ coupled with the inability of Os in properly con- 
trolled studies to get comparably low error rates, is reason enough to be 
skeptical of his claims. 

Meanwhile one relevant direction for independent research is to assess the 
likelihood that voice spectrograms could ever form a valid basis for absolute 
identification, by examining the fundamental issue of intra-^ similarity in re- 
lation to inter-^ differences. "The 'Voiceprint' Game" (Vanderslice, 1966) 
showed that, among only three ^s, whose voices sounded nothing alike, and who 

made no attempt at vocal disguises 

(1) A given speaker, saying the same word on different 
occasions, often produced widely different spectro- 
graphlc patterns; and 

(2) Different speakers often produced spectrographic 
patterns which were substantially identical. 

It goes without saying that no comparable demonstration has ever been arranged 
with handwriting or fingerprints~or other sorts of admissible evidence (rela- 
tive to their probative functions), e.g. bullets, tire tracks, blood tests, or 

. > 10 
typewriting. 

Nevertheless, "The 'Voiceprint' Game" has been impugned (in the brief of 
plaintiff-petitioner on appeal in State of New Jersey v. Paul Gro don Cary) on 
the grounds that (a) it was "deliberately devised to confuse [i.e. convince] 
the observer [p. 62]" (true — this of course was its raison d'dtre ); (b) "the 
spectrograms used were carefully selected for this [persuasive] purpose [p. 62] 
(true but irrelevant since Kersta's "technique" is open to precisely the same 
charge); and (c) "the clarity of the spectrograms .. .was somewhat blurred by a 
copying prOv^ess [p. 63]" (true— — multilithed copies of the spectrograms had to 
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Vanderslice 3 

be cut from a copy of the "Game" for use in the New Jersey case because the 
originals had been impounded as a defense exhibit in The People v. Edward Lee 
King, Jr . in J*os Angeles. 



New Evidence 

The following experiment was carried out in order to overcome (without 
granting the relevance of). these objections, and to see whether the results ob- 
tained in the '^Game" would be replicable under even more stringent constraints. 

Procedure . As before, three adult male ^s who were native speakers of North 
Midland American English were chosen without further regard for vocal similarity. 

As before, their voices were recorded on an Ampex tape recorder with a high 
quality microphone located in a sound-treated room, and the broadband spectro- 
grams were made on a Kay Electric Sona-Graph (this time a model 6061-B) following 
the same procedures and with approximately the same frequency range. In order to 
overcome the objection to what freedom there was in preparation of "The *Vji^ice- 
print* Game" to select and arrange the spectrograms in a way favoring the points 
to be made (intra-^ variance is large; inter-^ similarity is common), this time 
only one paradigm set of three tokens of one 2-sec. test utterance was recorded 
from each One ^ later returned and recorded several additional tokens, with 
and without attempts at vocal dlgulse, but these were never used in lieu of his 
initial set of three. 

The aim of these paradigm sets was to Illustrate the effects of trivial ' 
changes of. bodily posture in increasing the inevitable intra-^ variance — even 
where all three tokens were recorded within moments, eliminating the substantial 
variations occurring at diurnal or slower rates. The bodily postures were as 
follows: each ^ spoke the first token sitting upright with head erect , the sec- 

ond leaning forward with head lifted , and the third sitting sideways with head 
turned toward the microphone. It should be noted that the auditory effect of 
these cha,nges is practically undetectable. 

Results . Figures 1 through 4 present snipped-up spectrogram segements show- 
ing single words or short phrases— ^exactly as are used in the practice. 0f forensic 
sound spectrography-“arranged in, matrix 'format, j Opposite'" each figure in- an ' identi- 
fying key with the same tabular layout. In every case the leftmost three columns 
contain ^*s paradigm sets, one column per speaker; and, within those columnS|p the 
rows are by head position; top row, erect ; middle row, raised ; and lower row, 
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Vanderslice 

turned. The fourth column .from the left always contains other undigulsed tokens 
from speaker L (whose paradigm set is in column three) and the fifth and sixth 
columns, when present, show disguised tokens from the same S. Thus for the 
first three columns of each figure there were zero degrees of freedom in the 
selection and arrangement of the spectrograms. Once again it is evident that: 

1. Most of the similarity betwen any two spectrograms 
of the same word is linguistic , arising from that 
particular word’s being spoken in the same or a 
similar dialect and context, irrespective of 



whether one or two speakers were involved. 

2. Differences between the spectrograms of the same 
word spoken by two different speakers of similar 



dialect are small in comparison to within-speaker 
variations for that word, even if the speakers are 
easy to distinguish by ear. 



Insert Figures 1 through 4 about here 

It can further be observed (from the rightmost column in Figure 1 and the 
rightmost two in Figure 2) that while the vocal disguises employed here varied 
in their effectiveness, some of them were very effective indeed. A description 
of these disguise modes is beyond the scope of this paper and must await research 
in progress on "some parameters of vocal disguise." In general, distortions of 
tract and timing features are more efficacious than of source features (falsetto, 
whisper, ingressive voice), as could be predicted from a knowledge of the spec- 



trographic process. 



Conclusion 

Already, several persons have been sentenced to prison with "voiceprints 

as the primary evidence of their guilt. Some may well have been Innocent. 

There is no question but that any bona fide advances ini the technology of acoustic 

phonetics should be made available, as applicable, to the legitim a t e purposes 

of the police. However, "voiceprint identification" is to be likened not to 

fingerprints but rather to polygraph or "lie detector" tests (cf. Smith, 1967). 

Neither method is accepted by scientists in the relevant disciplines as reliab ly 

even in the technical sense of the word-far less its popular sense of "valid'." 
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Vander slice ^ 

Both employ large, complicated electronic devices likely to awe the lay juryman — 
or, still less legitimately, to intimidate suspects into confessing out of fear 
of a machine which in fact (so far an enabling the identification of people by 
their voices or the detection of untruths, respectively) is a hoax. The "Voice- 
printer" (spectrograph) and the "lie detector" (polygraph) are precisely alike 
in that stable guidelines or standards for interpreting their graphic records, 
to negotiate the respective binary cruces (same or -different voices; true or 
false responses) and arrive at valid (correct) and reliable (repeatable) deci- 
sions, appear permanently out of reach. Admission of such evidence into courfs 
of law as having probative value makes a mockery of that revered jurisprudential 
principle, presumption of innocence. 
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Footnotes 

^This paper is based in part on a talk entitled "The ’Voiceprint’ Hoax" 
given at the Graduate Center of the City University of New York in December, 

1968, and at the State University of New York at Buffalo in February, 1969. 

The research reported herein was performed in part pursuant to Contract OEC-3" 
6-061784-0508 with the U. S. Department of Health, Education, and Welfare, 

Office oif Education, under provisions of P. L. 83-531, Cooperative Research, and 
the provisions of Title VI, P. L. 85-864, as amended. This research report is 
one of several which have been submitted to the Office of Education as Studies 
in Language and Language Behavior , Progress Report No. VIII, February 1, 1969. 

^Mr. Kersta is president and founder of Voiceprint Laboratories, Inc. (now 
a division of Farrington Manufacturing Co.) which he established upon leaving 
the Bell Telephone Laboratories (c. 1965) after being a member of the technical 
staff for 39 years. Exaggerated claims for "voiceprint" accuracy were pub- 
licized as early as 1962 under the aegis of Bell Labs in a campaign calculated 
to discourage obscene and other nuisance phone calls. A full page paid adver- 
tisement in (inter alia) Physics Today , December 1962, titled News from Bell 
T elephon e Laboratories read: "WE’RE ’FINGERPRINTING’ VOICES .... Voiceprints . . . 

are actual pictures of sound, revealing the patterns of voice energy. Each 
pattern is distinctive and identifiable. They are so distinctive that voice- 
prints may have a place, along with fingerprint and handwriting identification, 

as an important tool of law enforcement. 

"The shape and size of a person’s mouth, throat and nasal cavities cause 
his voice energy to be concentrated into bands of frequencies. The pattern of 
these bands remains essentially the same despite modifications which may result 
from loss of teeth or tonsils, the advancement of age, or attempts to disguise 

the voice." 

Kersta on The Joe Pyne Show (February, 1967) stated that "we know of no 

way a person can change his speech such that it is impossible to identify him. 

It sometimes makes it a little more difficult— it takes longer to do it— but 

we have not found a condition where it has been impossible to find identifying 

features in a person’s voice." Even more egregious claims were made by L. G. 

Kersta Jr. on a BBC broadcast (c. 1966): "People cannot disguise their voices, 

we found this out. We can identify you by your voice. [Each voice is individual] 
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Vandersllce 7 

like your fingerprint.... We've had numerous law enforcement cases: obscene 

telephone calls, bomb threats; even have your mouth full of marbles, we'll still 
Identify you... or disguise your voice In any way, we'll still pick you up. It 
doesn't make any difference at all." But In fact all these grandiose claims are 
based on a few Informal pseudo-experiments with the most naive vocal "disguises" 
Imaginable. 

3 

The analogy with fingerprints. Implicit In the name. Is forwarded In 
various covert ways (besides bald comparisons as cited In Footnote 2) . One Is 
the continual use of contour spectrograms In publicity releases, apparently for 
their exotic look and vague resemblance to fingerprints. Indeed the same six 
contour prints of the word you (or a subset) , In which the two matching ones 
are of Kersta's voice, have been used ever since 1962 (see Figure A-1, Appendix 
A). But In fact these displays are not "volceprlnts" at all In the sense In 
which this term Is used In forensic spectrography , since only wlde-band 

("bar") spectrograms are used for comparisons by eye. 

A second way of promoting the analogy with fingerprints Is the Incorporation 
Into the "volceprlnt" mystique of nomenclature borrowed form the more prestigious 
art. For example, Kersta make sf much of counting up "points of similarity" — 
although he Is unable to define what constitutes one. The closest he has come 
was drawing sketches alongside each pair of spectrograms Introduced by the pro- 
secution In the King trial to aid the uninitiated jurors In seeing the supposed 
similarities. Figures B-1 and B-2, Appendix B, reproduce these "volceprlnts" 
along with Kersta's sketches. Comparison with between-speakers pairs In Figures 
1^4 suggests that his criteria of similarity are rather compliant. 

4 

See Stevens, et al. , 1968, p. 1606n; also Ladefoged»cand Vandersllce, 1967, 
pp. 135-136. 

^They showed also the clear superiority of ear over eye In this task of 
speaker recognition, contrary to Kersta's claim. 

A record which Kersta bases on his "over 100 field applications" whose ac- 
curacy Is attested by the fact that, purportedly, charges were always dropped 
or convictions obtained In accord with his reports. In two crucial cases 
(King & Cary ) , convictions based chiefly on "volceprlnt" evidence have been 
reversed for that reason on appeal. But a definitive decision on "volceprlnt" 
admissibility still hanges in the balance, awaiting further litigation. 
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Vanderslice 8 

^Not such ridiculous procedures as holding the nose, putting marbles in the 
mouth, or a handkerchief over the telephone mouthpiece, etc. (cf. Prince, 1967). 

Q 

Rather than the mere impressions of public figures (e.g., J. F. Kennedy) 
by entertainers (e.g., Vaughan Meador) which Kersta has pitted his "technique" 
against (informally). 

^P. Ladefoged, Professor of Phonetics at UCLA delivered one such challenge when 
he appeared opposite Kersta on nationwide TV ( The Joe Pyne Show , February, 1967). 
Mr. Kersta’ s reply to Dr. Ladefoged was; "I have many more important things to 
do than play kindergarten games with you." 



10 



wTiile it is true that even such widely accepted forms of evidence as finger- 
prints and ballistics require interpretation, yet their probative worth is founded 
on the legal fiction that a jury can "draw its own conclusion from the evidence," 
and that the opinion of experts is merely an "aid and guide" to their delibera- 
tions. It is difficult to see how this could be maintained in the use of "voice- 
prints." 

It is sometimes suggested that even though "voiceprints" are not trustworthy 
as evidence to convict, they might be allowed, like blood tests in paternity 
cases, to exonerate — i.e. prove that two voices could not be from the same per- 
son. The data in Figures 1-4, however, show that intra-^ variations for the 
same word or phrase (particularly if a vocal disguise must be allowed for) are 
so enormous relative to typical inter-^ differences that no such conclusion 
could be warranted. Indeed "voiceprints" admitted for this purpose, albeit 
less invidious to civil liberties and due process, might (if believed by a 
jury) undo months of careful police work and result in the specious exoneration 
of a hardened criminal. Since Kersta claims to have reported negative identi- 
fications in "over 80% of the more than 100 field applications [personal com- 
munication, November, 1967]," this may well be the gravest danger posed by his 
mystique. 

^^One of the more unsavory aspects of the King trial in Los Angeles was the 
patent use of "voiceprints" by the prosecution to circumvent the inadmissibility, 
under the Miranda decision, of an illegally obtained confession tape. Portions 
of this tape, including at least one clear instance of an admission, were sent 
to Kersta as the exemplar of King’s voice (see Ladefoged & Vanderslice, 1967, 
pp. 140-141; also Cohen, 1967). 
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Appendix A 

Figure A-1 shows the hackneyed set of six contour spectrogran^ of the word 
you. which have been printed and reprinted ubiquitously. Thqy are not "voice- 
prints" as that term is applied in law enforcement cases . This set of course 
conveys a quite misleading impression of the intra-S similarity and inter-S 
differences to be expected in random sample of population (cf. Figures 1-4). 



Appendix B 

Figures B-1 and B-2 show the entire body of Evidence Kersta presented in the 
g^n » trial to "prove" that an unknown voice on the audio tape of a CBS pre-TV 
interview and an unknown voice on the audio portiPS of the CBS broadcast video- 
tape. respectively, were both the voice of the defendant. There was no other 
evidence whatever to show that the two unknown ^Voices were of one and the same 
person (whether or not it was King). Thus Kersta' s claim (in a rebuttal to 
" The’Voiceprint? MystiqMe"at a meeting of the Acoustical Society of America. 
Miami Beach. November. 1967 . and :elsewhere) that he has never used fewer than 
thirteen pairs of words in making an identification is not strictly veridical. 
The five pairs in Figure B-1 must be considered the minimum (Ladefoged & 
Vanderslice. 1967) at least until a new low is established. 






Figure Captions 

Fig. 1. Spectrograms of the phrase be out, spoken by Ss J, K. and L. S and 
other pertinent Information, including head position in paradigm sets, noted 

at corresponding position In key. 

Fig. 2. Spectrograms of the word i spoken by ^s J. K. and L. and other 
pertinent information, including head position in paradigm sets, noted at cor- 

responding position In key. 

Fig. 3. Spectrograms of the word door spoken by Ss J. K. and L. S and 
other pertinent Information. Including head position in paradigm sets, noted 
at corresponding position In key. 

Fig. 4. Spectrograms of the phrase know you will spoken by Ss J. K. and 
L. S and other pertinent information, including head position in paradigm 
sets, noted at corresponding, position In key. 

Fig. A-1. Contour spectorgrams of the word you spoken by five Ss. Upper 
left and lower right represent same S (Kersta) . These are not "voiceprints" 
as that term Is used In forensic applications . 
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Fig. B-1. People's exhibit 19 from the MBS <=ase: Kersta s spectrogram , 

with puwtive "points of similarity" sketched, purporting to link an unknown 
voice on the CBS pre-TV interview audio tape (left member of each pair) with 
the known voice of Edward Lee King. Jr. (right members) illicitly taped during 
his interrogation. Reproduced unretouched as labeled and mounted by Kersta. 
except for excision of blank space and slight reduction in size. 

Fig B-2 People’s exhibit 20: Kersta' s spectrograms and sketches purpor- 

ting to’ link an unknown voice on the audio portion of the CBS broadcast video- 
tape (left member of each pair) with the known voice of Edward Lee King. Jr. 
(right member) illicitly taped during his interrogation. Reproduced unretouc e 
as labeled and mounted by Kersta. except for excision of blank space and slight 

reduction in size. 
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Key to Figure 4 
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